Less is More: Improving the Speed and Prediction Power of Knowledge Tracing by Using Less Data

نویسندگان

  • Bahador B. Nooraei
  • Zachary A. Pardos
  • Neil T. Heffernan
  • Ryan Shaun Joazeiro de Baker
چکیده

Knowledge Tracing is perhaps the most widely used student model in the field of educational data mining. In this paper we report on the effects of using only a subset of data in training the Bayesian Network that represents this student model. The standard practice is to use all of the students’ data for a given skill to fit the model. We analyze two datasets; one from the Algebra Cognitive tutor and the other from the Genetics Cognitive tutor. We found that in both datasets, the difference in accuracy between using all the students' data versus only the most recent 15 data points of each student was not significantly different. Using only 15 responses however, resulted in an EM training time which was 15 times faster than using all data. This result suggests that the Knowledge Tracing model needs only a small range of data in order to learn reliable parameters. The implications of this result is a substantial savings in model training time that allows for more complex models to be fit or individualized models to be trained online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

پیشگویی گام‌ـ بلند سرعت باد مبتنی بر مدل ترکیبی RNNGA

For proper and efficient utilization of wind power, the prediction of wind speed is very important. Wind is one of the main sources of energy in the world, but the wind turbines have a lack of reliability, continuity and homogeneity in power production. On the other hand, sudden changes of wind speed, lead to risk for wind turbine units health. Therefore, the prediction of wind speed for turbin...

متن کامل

Torque Ripple Reduction of Electrolytic Capacitor-less BLDC Motor Drive

Brushless DC motors called BLDC are used in many industrial and non-industrial applications today for reasons such as very high efficiency, easy control method and high reliability, and their use is increasingly used in mass production applications, especially home appliances. But these motors require the use of an electric drive, even in constant speed applications. Commercialization of these ...

متن کامل

Maximum Power Point Tracking of Wind Energy Conversion System using Fuzzy- Cuckoo Optimization Algorithm Strategy

Nowadays the position of the renewable energy is so important because of the environment pollution and the limitation of fossil fuels in the world. Energy can be generated more and more by the renewable sources, but the fossil fuels are non-renewable. One of the most important renewable sources is the wind energy. The wind energy is an appropriate alternative source of fossil fuel. The replacem...

متن کامل

Modeling the Operational Speed of Tangents and Curves in Four-lane Highways Based on Geometric and Roadside Factors

Operational speed is an index that represents drivers’ speeding behaviors and shows the comfort and safety levels they experience. Many models had been proposed to predict operational speed in tangent and curve segments of highways. Most of these works had used geometric, with few of them conducted using roadside variables to predict operational speed. Also, the operational speed study in multi...

متن کامل

Robust state estimation in power systems using pre-filtering measurement data

State estimation is the foundation of any control and decision making in power networks. The first requirement for a secure network is a precise and safe state estimator in order to make decisions based on accurate knowledge of the network status. This paper introduces a new estimator which is able to detect bad data with few calculations without need for repetitions and estimation residual cal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011